Gaussian Process Regression with Mismatched Models

نویسنده

  • Peter Sollich
چکیده

Learning curves for Gaussian process regression are well understood when the 'student' model happens to match the 'teacher' (true data generation process). I derive approximations to the learning curves for the more generic case of mismatched models, and find very rich behaviour: For large input space dimensionality, where the results become exact, there are universal (student-independent) plateaux in the learning curve, with transitions in between that can exhibit arbitrarily many over-fitting maxima; over-fitting can occur even if the student estimates the teacher noise level correctly. In lower dimensions, plateaux also appear, and the learning curve remains dependent on the mismatch between student and teacher even in the asymptotic limit of a large number of training examples. Learning with excessively strong smoothness assumptions can be particularly dangerous: For example, a student with a standard radial basis function covariance function will learn a rougher teacher function only logarithmically slowly. All predictions are confirmed by simulations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

UNDERSTANDING BEHAVIOR OF ANTINEOPLASTIC MOLECULES BASED ON MLR MODELS

New statistic based models provide a wide area of prediction equipments for different science areas. Among these fields biology have just entered the contest of interdisciplinary sciences. Drug discovery is a long and expensive process which could be decreased with theoretical approaches. In this study, 500 reported assayed anti cancer molecules were extracted from Science Direct articles, sket...

متن کامل

Relations between information and estimation in scalar Lévy channels

Fundamental relations between information and estimation have been established in the literature for the discrete-time Gaussian and Poisson channels. In this work, we demonstrate that such relations hold for a much larger class of observation models. We introduce the natural family of discrete-time Lévy channels where the distribution of the output conditioned on the input is infinitely divisib...

متن کامل

A Geometric View on Constrained M -Estimators

We study the estimation error of constrained M -estimators, and derive explicit upper bounds on the expected estimation error determined by the Gaussian width of the constraint set. Both of the cases where the true parameter is on the boundary of the constraint set (matched constraint), and where the true parameter is strictly in the constraint set (mismatched constraint) are considered. For bo...

متن کامل

Gaussian Process Regression with Censored Data Using Expectation Propagation

Censoring is a typical problem of data gathering and recording. Specialized techniques are needed to deal with censored (regression) data. Gaussian processes are Bayesian nonparametric models that provide state-of-the-art performance in regression tasks. In this paper we propose an extension of Gaussian process regression models to data in which some observations are subject to censoring. Since...

متن کامل

Prediction of the main caving span in longwall mining using fuzzy MCDM technique and statistical method

Immediate roof caving in longwall mining is a complex dynamic process, and it is the core of numerous issues and challenges in this method. Hence, a reliable prediction of the strata behavior and its caving potential is imperative in the planning stage of a longwall project. The span of the main caving is the quantitative criterion that represents cavability. In this paper, two approaches are p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001